Bush-Mosteller learning for a zero-sum repeated game with random pay-offs

نویسندگان

  • Alexander S. Poznyak
  • Kaddour Najim
چکیده

This paper deals with the design and analysis of a modijied version of the BushMosteller reiqfimement scheme applied by partners in a zero-sum repeuted grme with random pay-offs. The suggested study is based on the learning automata paradigm and a limiting average reward criterion is tackled to analyse the arising Nash equilibrium. No information concerning the distribution qf the pay-off is a priori available. The noveltjl of the suggested adaptive strategj3 is related to the incorporation qf a 'normalization procedure' into the standard Bu.sh-Mostcller scheme to provide a po.r.sibilitj, to operate not only with binary but also ~ , i t h unj3 hounded rewards of a stochastic nature. The analysis of the convergerre (adaptation) us well as the convergence rate (rute of adaptation) are presented and the optimal design parumetcrs of this adaptive procedure are derived. The obtained adaptation rute turns out to be of o(n-'I3).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A More General Model of Cooperation Based on Reinforcement Learning: Alignment and Integration of the Bush-mosteller and the Stochastic Collusion and the Power Law of Learning: Aligning and Integrating the Bush-mosteller and the Roth-erev Reinforcement Learning Models of Cooperation

Analytical game theory has developed the Nash equilibrium as theoretical tool for the analysis of cooperation and conflicts in interdependent decision making. Indeterminacy and demanding rationality assumptions of the Nash equilibrium have led cognitive game theorists to explore learning-theoretic models of behavior. Two prominent examples are the Bush-Mosteller stochastic learning model and th...

متن کامل

Learning through reinforcement for N-person repeated constrained games

The design and analysis of an adaptive strategy for N-person averaged constrained stochastic repeated game are addressed. Each player is modeled by a stochastic variable-structure learning automaton. Some constraints are imposed on some functions of the probabilities governing the selection of the player's actions. After each stage, the payoff to each player as well as the constraints are rando...

متن کامل

Multiple attribute decision making with triangular intuitionistic fuzzy numbers based on zero-sum game approach

For many decision problems with uncertainty, triangular intuitionistic fuzzy number is a useful tool in expressing ill-known quantities. This paper develops a novel decision method based on zero-sum game for multiple attribute decision making problems where the attribute values take the form of triangular intuitionistic fuzzy numbers and the attribute weights are unknown. First, a new value ind...

متن کامل

A TRANSITION FROM TWO-PERSON ZERO-SUM GAMES TO COOPERATIVE GAMES WITH FUZZY PAYOFFS

In this paper, we deal with games with fuzzy payoffs. We proved that players who are playing a zero-sum game with fuzzy payoffs against Nature are able to increase their joint payoff, and hence their individual payoffs by cooperating. It is shown that, a cooperative game with the fuzzy characteristic function can be constructed via the optimal game values of the zero-sum games with fuzzy payoff...

متن کامل

Reinforcement learning account of network reciprocity

Evolutionary game theory predicts that cooperation in social dilemma games is promoted when agents are connected as a network. However, when networks are fixed over time, humans do not necessarily show enhanced mutual cooperation. Here we show that reinforcement learning (specifically, the so-called Bush-Mosteller model) approximately explains the experimentally observed network reciprocity and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Int. J. Systems Science

دوره 32  شماره 

صفحات  -

تاریخ انتشار 2001